Towards Zero-Shot Autonomous Inter-Task Mapping through Object-Oriented Task Description

نویسندگان

  • Felipe Leno da Silva
  • Helena Reali Costa
چکیده

The successful use of Reinforcement Learning in complex tasks depends on techniques to scale-up classical learning algorithms because they suffer from the curse of dimensionality. Transfer Learning approaches have been used to accelerate learning by reusing knowledge gathered from the solution of previous tasks. However, discovering how different tasks are related is a very complex undertaking if a human is not available (or is unable) to manually establish a mapping between tasks. We here propose an algorithm to autonomously estimate a Probabilistic Inter-TAsk Mapping (PITAM) across tasks described in an object-oriented manner, which requires less domain knowledge than a handcrafted Inter-Task Mapping. We also propose two strategies for Temporal-Difference algorithms to transfer knowledge using learned PITAMs. Our experiments evaluate varied scenarios in which the source and target tasks differ in several aspects, and our proposal presents benefits over both regular learning and Q-value Reuse using a detailed InterTask Mapping.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Supplementary Material: Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning

Inter/Extrapolation. In this experiment, a task is defined by three parameters: action, object, and number. The agent should repeat the same subtask for a given number of times. The agent is trained on all configurations of actions and target objects. However, only a subset of numbers is used during training. In order to interpolate and extrapolate, we define analogies based on simple arithmeti...

متن کامل

Using Task Descriptions in Lifelong Machine Learning for Improved Performance and Zero-Shot Transfer

Knowledge transfer between tasks can improve the performance of learned models, but requires an accurate estimate of the inter-task relationships to identify the relevant knowledge to transfer. These inter-task relationships are typically estimated based on training data for each task, which is inefficient in lifelong learning settings where the goal is to learn each consecutive task rapidly fr...

متن کامل

Audience awareness of Persian learners of English writing: Towards a model of task-oriented strategies

Persian  learners  of  English  often  avoid  attending  to  audience  considerations,  which  brings them  lower  scores.  The  present  study  was  conducted  in  a  major  university  in  Iran  to  help Persian learners develop a sense of audience awareness in writing. Thirty five Persian students of English  were trained  with a focus on process-oriented instruction. The intended task was a...

متن کامل

Using Task Features for Zero-Shot Knowledge Transfer in Lifelong Learning

Knowledge transfer between tasks can improve the performance of learned models, but requires an accurate estimate of the inter-task relationships to identify the relevant knowledge to transfer. These inter-task relationships are typically estimated based on training data for each task, which is inefficient in lifelong learning settings where the goal is to learn each consecutive task rapidly fr...

متن کامل

Autonomous Attentive Exploration in Search and Rescue Scenarios

In task-oriented exploration a robot has to direct its sight and delving towards the most promising regions of the environment, according to the task, in order to optimize its search. If the goal is dynamically set on the basis of what it is perceived, attention plays a crucial role, as it allows to combine fast glancing with accurate analysis, enabling the robot to quickly jump to conclusion b...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017